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BOOTSTRAP 
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Abstract.  Rao,  Pathak  and  Koltchinskii  (1997)  have  recently  studied  a  sequential 
approach  to  resampling  in  which  resampling  is  carried  out  sequentially  one-by-one 
(with  replacement  each  time)  until  the  bootstrap  sample  contains  m  «  (1  —  e~l)n  « 
*632 n  distinct  observations  from  the  original  sample.  In  our  previous  work,  we  have 
ratablished  that  the  main  empirical  characteristics  of  the  sequential  bootstrap  go 
through,  in  the  sense  of  being  within  a  distance  of  order  0(n~3/4)  from  those 
of  the  usual  bootstrap.  However,  the  theoretical  justification  of  the  second  order 
correctness  of  the  sequential  bootstrap  is  somewhat  difficult.  It  is  the  main  topic 
of  this  investigation.  Among  other  things,  we  accomplish  it  by  approximating  our 
sequential  scheme  by  a  resampling  scheme  based  on  the  Poisson  distribution  with 
mean  p  =  1  and  censored  at  X  =  0. 


1.  Introduction 


Efron  (1979)  introduced  the  bootstrap  method  of  resampling  as  a  ubiquitous  sam¬ 
pling  technique  of  estimating  the  variance  of  an  estimator  and  sampling  distribution 
of  a  given  statistic.  In  a  fundamental  paper,  Bhattacharya  and  Ghosh  (1978)  have 
demonstrated  that  Edgeworth  expansion  for  a  wide  class  of  statistics  can  be  derived 
from  Edgeworth  expansions  for  multivariate  sample  means.  This  technique  has  been 
used  by  Singh  (1981)  to  show,  in  the  case  of  univariate  sample  mean,  that  the  boot¬ 
strap  is  more  accurate  than  the  central  limit  theorem  when  higher  order  population 
moments  exist.  These  ideas  are  further  exploited  by  Babu  and  Singh  (1983,  1984) 
to  show  the  superiority  of  the  bootstrap  method,  and  by  Babu  and  Singh  (1985)  to 
obtain  Edgeworth  expansions  for  the  ratio  statistic  and  similar  statistics  based  on 
samples  from  finite  populations.  The  method  is  also  used  by  Babu  and  Singh  (1989) 
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to  obtain  global  Edgeworth  expansions  for  functions  of  means  of  random  vectors, 
when  one  of  the  coordinates  has  a  lattice  distribution  and  the  remaining  part  of  the 
vector  has  a  strongly  non-lattice  distribution.  Later,  Gene  and  Zinn  (1990)  showed 
that  m  a  certain  weak  sense,  the  bootstrap  method  is  valid  (consistent)  if  and  only  if 
the  central  limit  theorem  holds.  In  fact  the  central  limit  theorem  furnishes  accuracy 
of  approximation  of  order  o(l),  while  if  the  third  population  moment  exists,  one  can 
expect,  m  many  commonly  encountered  populations  the  accuracy  of  the  bootstrap 
method  to  be  of  order  o(n"1/2),  where  n  denotes  the  sample  size.  Thus  while  the 
bootstrap  method  has  the  potential  of  being  second-order  accurate,  the  central  limit 
approximation  is  not  so.  This  is  one  of  the  several  reasons  for  the  current  interest 
and  preference  in  the  literature  for  those  methods  of  resampling  that  are  second-order 
accurate,  i.e. ,  accurate  of  the  order 

Stemming  from  Efron’s  observation  that  the  information  content  of  a  bootstrap 
sample  is  based  on  approximately  (1  -  e-x)100%  «  63%  of  the  original  sample,  Rao, 
Pathak  and  Koltchmskii  (1997)  have  introduced  a  sequential  resampling  method  in 
which  sampling  is  carried  out  one-by-one  (with  replacement)  until  (m  +  1)  distinct 
original  observations  appear,  where  m  denotes  the  largest  integer  not  exceeding  (1  - 
e  l)n.  It  has  been  shown  that  the  empirical  characteristics  of  this  sequential  bootstrap 
are  within  a  distance  of  order  0(n~3/4)  from  the  usual  bootstrap.  The  authors  provide 
a  heuristic  argument  in  favor  of  their  sampling  scheme  and  establish  the  consistency 

of  the  sequential  bootstrap;  however  the  question  of  second-order  correctness  was  not 
addressed. 

The  main  object  of  this  paper  is  to  examine  the  second  order  correctness  of  the 
sequential  bootstrap.  The  theoretical  justification  of  this  is  somewhat  more  difficult 
because  of  the  dependence  among  the  bootstrap  sample  units.  At  this  time  a  rigorous 
Edgeworth  expansion  under  this  kind  of  dependence  is  unavailable  in  the  literature. 

A  cumbersome  approach  based  on  computation  of  cumulants,  under  the  (unsubstanti¬ 
ated)  assumption  that  a  formal  Edgeworth  expansion  is  valid,  may  be  given  along  the 
lines  of  Hall  and  Mammen  (1994).  This  does  not  lead  to  a  complete  solution  as  the 
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Edgeworth  expansions  are  not  known.  Instead  we  first  approximate  the  sequential 
bootstrap  by  another  sequential  resampling  scheme  based  on  the  Poisson  distribu¬ 
tion.  Under  the  new  scheme  the  “independence”  of  sample  units  under  resampling 
is  preserved.  A  rigorous  justification  of  the  Edgeworth  expansion  can  now  by  given 
more  easily.  This  then  provides  a  sound  theoretical  framework  in  which  the  second 
order  correctness  for  the  sequential  bootstrap  can  be  established.  In  this  paper  we 
provide  details  for  the  sample  mean.  Edgeworth  expansions  for  statistics  that  can 
be  represented  as  a  smooth  function  of  multivariate  sample  mean  are  considered  in 
Section  4. 


2.  Sequential  Resampling  Scheme 

Let  S  =  (Xi,X2,...  , Xn)  be  a  random  sample  from  a  distribution  F,  and  9(F) 
a  parameter  of  interest.  Let  Fn  denote  the  empirical  distribution  function  based 
on  S ,  and  suppose  that  6(Fn)  is  to  be  used  as  an  estimator  of  9(F).  The  Efron’s 
bootstrap  method  approximates  the  sampling  distribution  of  a  standardized  ver¬ 
sion  of  y/n(9(Fn)  —  9(F))  by  the  resampling  distribution  of  a  corresponding  statistic 
y/n(9(Fn)  ~  9(Fn))  based  on  a  bootstrap  sample  Sn  in  which  the  original  F  has  been 
replaced  by  the  empirical  distribution  based  on  the  original  sample  S,  and  Fn  of  the 
former  statistic  has  been  replaced  by  the  empirical  distribution  based  on  a  bootstrap 
sample  Fn.  In  Efron’s  bootstrap  resampling  scheme,  Sn  =  (XUX2, ...  ,  Xn)  is  a  ran¬ 
dom  sample  of  size  n  drawn  from  S  by  simple  random  sampling  with  replacement 
(SRSWR).  In  the  Rao-Pathak-Koltchinskii  (1997)  sequential  scheme,  observations 
are  drawn  from  S  sequentially  by  SRSWR  until  there  are  (m  +  1)  =  [n(l  -  e-1)]  +  2 
distinct  original  observations  in  the  bootstrap  sample;  the  last  observation  is  dis¬ 
carded  to  ensure  technical  simplicity.  Thus  an  observed  bootstrap  sample  under  the 
Rao-Pathak-Koltchinskii  scheme  admits  the  form: 


SN  =  (XUX2>...  ,XN) 


(2.1) 
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in  which  Xi,X%, . . .  ,  X^  have  m  «  n(l  —  e_1)  distinct  observations  from  S.  The 
random  sample  size  N  admits  the  following  decomposition  in  terms  of  the  independent 
random  variables: 


N  =  I1+Ii  +  ...  +  Im  (2.2) 

in  which  m  =  [n(l  -  e-1)]  +  1;  Ix  =  1,  and  for  each  k,2<k<m, 

— <2-3) 

Although  we  have  established  the  consistency  of  this  sampling  scheme,  a  rigorous 
proof  of  its  second  order  correctness  requires  an  Edgeworth  expansion  for  dependent 
random  variables;  such  an  expansion  is  unavailable  in  the  literature  at  the  present 
time.  An  alternative  approach  that  can  be  used  is  to  slightly  modify  the  preceding 
resampling  scheme  so  that  existing  techniques  on  Edgeworth  expansion,  such  as  those 
of  Babu  and  Bai  (1996),  Bai  and  Rao  (1991, 1992),  Babu  and  Singh  (1989)  and  others, 
can  be  employed.  A  modification  of  our  previous  resampling  scheme  that  allows  the 
second-order  correctness  to  go  through  easily  is  as  follows. 

2.1.  Poisson  Bootstrap.  For  the  selection  of  a  bootstrap  sample  with  a  given  number 
m  of  distinct  units,  under  the  Poisson  Bootstrap,  we  provide  a  conceptual  definition 
and  a  practical  approach.  Let  us  take  a  sample  ckj,  . . .  ,  an  of  n  independent  observa¬ 
tions  from  P  ( 1 ) ,  i.e.,  Poisson  distribution  with  mean  1.  If  there  are  exactly  m  nonzero 
values  in  the  sample,  we  accept  it  and  take 


S  —  {(^i>«i),  (^2,«2),.  •  •  ,(ATn, <*„)},  (2.4) 

i.e.,  with  the  observation  Xt  repeated  ax  times,  as  the  bootstrap  sample.  If  the 
number  of  nonzero  values  in  ai, . . .  ,an  is  not  exactly  m,  we  reject  the  entire  sample 
and  draw  another  sample  of  size  n.  The  bootstrap  sample  size  N  of  S  as  in  (2.1)  is 
a  random  variable 
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N  =  c*i  +  . . .  +  an.  (2.5) 

A  practical  way  of  implementing  this  resampling  scheme  is  to  first  assign  at  random 
(n  —  m)  a’s  a  value  of  zero  and  to  the  remaining  m  q’s  values  independently  chosen 
from  the  Poisson  distribution  with  mean  p  =  1  and  censored  at  X  =  0.  An  outline 
of  the  equivalence  of  these  two  procedures  is  as  follows. 

Theorem  2.1.  The  moment  generating  function  M^(t)  of  N,  the  sample  size  of  the 
Poisson  resampling  scheme,  is  given  by 


MN(t)  = 


-(e(*‘-i)  _  e-iy 
(1  ~  e_1 


m 


(2.6) 


Proof.  Let  Y\,Y%,...  ,Yn  be  n  Poisson  variables  with  mean  p  =  1.  Then  it  is  easily 
seen  that 


P(N  =  w)  =  const  f  ^ 


\  1  «1!«2!  •  •  •  «m! . 

where  the  sum  extends  over  all  positive  natural  numbers  q;i,q:2, 
+  •  •  •  +  Oim  —  w.  It  then  follows  that  (see  Pathak  (1961)) 


(2.7) 

.  ,  am  such  that 


P(N  =  w)  =  const 


^(m"-(™)(m!-ir  +  ...±n 


=  const— (A^^lx^) 

where  A  is  the  difference  operator  with  unit  increment. 
From  (2.8)  it  follows  that 


(2.8) 


Consequently  the  moment  generating  function  M^it)  of  N  is  given  by 


(2.9) 
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_ _ _  piw  xw 

A-— U=o 


(e  -  1)* 


f>o  (e  —  !)m 

Am(e^ 0*i 


EA" 


=Am^u=o 


(e  -  1)' 


:{eme‘  -  ^je(m"1)e‘  +  f^)e(m"2)e‘ 


e(m-2)e‘  | 


(e  —  l)m  1 

pme 1 

_  (ee‘  —  l)lm 

"  .(e-1). 

-  (e(gt  l)  ~  e~1)l "* 
~  [  (!  —  e_1)  . 
This  completes  the  proof. 


m\  t  m\  , 

1  *  +  2 


(2.10) 


(2.11) 


The  preceding  theorem  shows  that  the  distribution  of  N  can  be  viewed  as  that  of 
the  sum  of  m  IID  random  variables  with  a  common  distribution  with  the  moment 
generating  function  given  by  the  formula 


(c(e‘  -  e  *) 


(2.12) 


w  (l  _e-i)  • 

It  is  evident  that  m(t )  is  the  moment  generating  function  of  the  Poisson  distribution 
with  [x  =  1  and  censored  at  X  =  0.  Let  Y  denote  a  random  variable  with  moment 
generating  function  m(t).  Then  E(Y)  =  1/(1  -  e-1)  and  V(Y)  =  e(e  -  2)/(e  -  l)2. 
Therefore 


E(N)  =  mE(Y) 
=  n  +  0(1) 


(2.13) 
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and 


V(N)  =  mV(Y) 

=  n(e  —  2)/ (e  —  1)  4-  0(1)  (2.14) 

2.2.  Advantage  of  sequential  bootstrap  over  regular  bootstrap.  One  of  the  main  advan¬ 
tages  of  the  sequential  bootstrap  over  the  classical  fixed  sample  size  bootstrap  is  to 
avoid  situations  where  a  bootstrap  sample  has  several  repeated  observations  which 
may  give  rise  to  a  degenerate  value  of  the  statistic  under  consideration. 

Since  the  bootstrap  utilizes  all  the  data  points,  in  general,  the  bootstrap  estimator 
of  variance  of  a  statistic  is  not  robust  even  for  robust  statistics.  The  bootstrap 
estimator  of  variance  of  the  sample  median  mn  based  on  the  sample  X\ , . . .  ,  Xn  is 
given  by 


-  n  n 

K  =  ?  £  £(*«>  "  *«)’  P.>Pi.n, 


i=l  j= 1 


where 


-^(1)  <  •  •  •  <  X(n) 


is  the  ordering  of  the  data, 


Pi,n  =  n(n  f  ur(  1  -  u)n  T  du, 

\r-lj 

find  r  =  (ra/2)  it  n  is  an  even  integer  and  =  (n+l)/2  if  it  is  an  odd  integer.  Breakdown 
point  is  a  widely  used  measure  of  robustness  in  modern  statistical  literature.  The 
breakdown  point  of  a  statistic  6n  =  9n(X i,...  ,X„)  is  defined  as  ( k/n ),  where  k  is 
the  minimum  number  of  data  points  needed  to  be  replaced  by  worst  possible  outliers 
to  move  the  statistic  beyond  any  bound.  In  the  case  of  bootstrap  variance  of  sample 
median,  the  breakdown  point  is  (1  /n). 
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However,  in  the  case  of  sequential  bootstrap  or  Poisson  bootstrap,  X (<)  for  i  < 
m—  (n/2)  do  not  enter  the  estimate  of  the  variance  of  the  sample  median.  Hence  the 
breakdown  point  is  —  —  e_1  =  0.132. 

With  these  results,  we  now  proceed  to  establish  the  second  order  correctness  of  the 
Poisson  Bootstrap,  a  sequential  bootstrap  based  on  the  Poisson  distribution. 

3.  Preliminary  Results 

To  establish  Proposition  1  of  Section  4,  we  require  some  preliminary  results.  First 
note  that  by  Theorem  10.1  of  Bhattacharya  and  Ranga  Rao  (1986),  there  exists  a 
random  variable  V  with  distribution  J  such  that  E(V10)  <  oo,  P(\V\  >  1)  <  -,  and 
the  characteristic  function  J  of  J  vanishes  outside  the  interval  [— c,  c],  for  some  c  >  0, 
i.e. 

J(t)  =  0  for  \t\  >  c  (3.1) 

For  any  e  >  0,  let  Je  denote  the  distribution  of  Ve.  The  next  lemma  is  a  trivial 
consequence  of  Lemma  24.1  of  Bhattacharya  and  Ranga  Rao  (1986).  The  inequality 
is  similar  to  (4.1)  of  Babu  and  Singh  (1984).  See  also  (4.2)  of  Babu  and  Singh  (1984). 

Lemma  1.  Suppose  h  is  a  Borel  measurable  function,  on  the  real  line  R,  bounded  by 
1.  Let  p  be  a  finite  measure  and  u  a  finite  signed  measure.  Then  for  any  0  <  e  <  1, 

|  J  h(y)(p,  —  v)(dy)\  <  15^|(/i  —  u)*  J£|(R)  +  (/i(R)  +  ^+(R))P(|V|  >  e-1/2) 

+3-£,/V(R)  +  is+(R)) 

+  sup  /  u(h,  2e,  z  +  y)u+  (dy) ) , 

|z|<«*/4  J  J 

where  u  =  v+  —  v~  is  the  Jordan  decomposition,  \u\  =  u+  +  v~  is  the  total  variation 
measure  and 
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Further  we  have  for  any  0  <  v  <  1,  0  <  e  <  1, 

j u j(h,  e,v-  y)(f>(y)dy  <3  J  u(h,  e,  y)<j){y)dy 

+0(f-1  exp(— ^v-2)).  (3.2) 

The  inequality  (3.2)  is  the  relation  (3.5)  of  Sweeling  (1977). 

The  next  lemma  helps  in  estimating  the  total  variation  norm  of  (y  —  v)  *  J€.  Let 
Dkh(t,s  ,  a;)  denote  the  A;-th  partial  derivative  of  h  with  respect  to  t. 

Lemma  2.  Let  g  be  a  real  valued  function  on  R  x  Z2  satisfying 


oo  oo  « 

EE  J  (l  +  x2)\g(x,r,j)\dx  <  oo. 


r——ooj=~oo 


Then  for  all  integers  r  and  j, 


/OO  n  poo 

\g(x,r,  j)\dx  <  max  /  /  \Dkg(t, s,u>)\dt  ds  du, 

oo  fc=0’2  JG  J —oo 

where  G  =  [— 7r,  7r]  x  [— 7T,  7r]  and  g  denotes  the  Fourier  transform  of  g,  that  is 

g(t,s,u)=j>2  ^  e-tar+ia)j  j  e^(x,r,j)  dx. 


T  —  —  00  J=~00 


Proof.  Following  the  proof  of  Lemma  11.6  of  Bhattacharya  and  Ranga  Rao  (1986), 
let  A  =  {x  :  g(x,r,j )  >  0}.  If  for  each  r,j, 


/OO 

etUcg(x,r,j)  dx, 

OO 


then  the  Fourier  transform  hrj  of  the  function 


KAX)  =  (1  +x2)g(x,r,j) 


is  given  by 


hr,j(t)  =  9far,j)  -  D2g(t;r,j). 
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Thus  by  Fourier  inversion  and  Fubini’s  Theorem  we  have 


(27 r)  3  f  [  [  (d(tisiu)  ~  D2g(t,s,uj)\e  t(-ST+IJ}j+tx^dsdudt 

J—  OO  J  —  7T  J  —  7T  \  / 

=  {l  +  x2)g(x,r,j). 

Hence  we  have 


/  \d(x,r,j)\dx=  f  g(x,r,j)dx~  f  g(x,r,j)dx 

J- oo  7a  Jr-A 

=  (,Ja~  Jr  J)(1  +  x2)g(x’r’tidx 

=(2tt)-3( f  )(i+x2)-i(  r  r  r gkm,*) 

\jA  JR-A/  \  J -oo  J —tt  J -tt 

-  D2g(t,  s,  u))e~i^sr+uj+tx'>ds  duj  dt^j  dx 
<(27r)~3f  f  (1 +a;2)-1da;N)  f  f (\g(t,s,u)\  +  \D2g(t,s,u)\)dsdujdt 

\  J —oo  J  J —oo  J  G 

<  max  /  I  \Dkg(t,s,u)\dsdujdt. 

^  J  —  OO  J  G 


This  completes  the  proof  of  the  Lemma. 


□ 


The  next  lemma  is  Lemma  2  of  Babu  and  Singh  (1984)  and  is  stated  here  for  ready 
reference. 

Lemma  3.  Suppose  X\  satisfies  Cramer’s  condition  (3.1).  For  any  K  >  0,  there 
exists  a  0  <  7  =  7 (K)  <  1,  and  77  >  0  such  that  for  almost  all  sample  sequences 

{X}}, 


lim  sup  sup 

n— >00  K<|t|<e’?n 


iy^etu(x,-*n) 

j=i 


<  7- 


The  next  three  lemmas  are  similar  to  Lemmas  2  and  3  of  Babu  and  Bai.  To  state 
these  let  Cj  satisfy  conditions  of  Proposition  1.  Let  dj  =  dj<n  =  Cjn~ */2.  Recall  that 
for  some  M’  >  1 
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J2dj = °>  XM  =  1  and  -  (MVv^) 

1  1  j= 1 

Recall  that  fn  denotes  the  characteristic  function  of  Yj^j)- 

r  and  m,  the  Fourier  transform  of 

(71  71  n  v 

djYj  ^2  Yj  =  r,  ^Sj  =  m) 

j—1  j=  1  j— 1  ' 


is  given  by 


r,m)  =  E  (euZU  yi=riE»=1 

=  4^2  J  J  fn(t,s,u)e~iar~iu,mdsdu, 

where  /n  is  the  characteristic  function  of  YTi=\{diYi>  Yj^j)-  Note  that 


fn(t,s,u )  =  +  «,w), 

i=i 


where 


/(u,w)  =  E\eiuYi+iu,Sl 

-i1- 


etw  +  e«w+e*“ 


V 


J 


Let  for  any  subset  R  of  ,  ra}, 


fn,Fl(t,  S,u)  —  /(tdj  +  s,  w) 

jeR 


and  for  any  0  <  k  <  2,  let 


(3.3) 

For  each  fixed 


(3.4) 


(3.5) 


/».*(<,  s,w)  =  msx{\f„,R(t,  s,w)|}, 
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where  the  maximum  is  taken  over  all  subsets  R  of  {1, . . .  ,  n}  of  size  n  —  k.  Finally 

let 


(3.6) 


Lemma  4.  For  any  0  <  k  <2, 


fn,k(t ,  s,  a;)  <  exp(l  -  0.2ra(l  -  dn(t)).  (3.7) 


Proof.  Observe  that 


-  V  lf(tdj  +  s,o;)|2  =  -  V  E(ei^+a)(Xl~Yi)+iu,^-6A 

V  ) 

eit(Y i -Y*)dj ^  eis(Yi -r2)+tu;(«i  ) 

<s^n(t(y,-y2))^ 

<  P(\Yi  -  Yt\  ?  1)  +dn(t)P{\Yl  -  V2 1  =  1) 


<l-(l-dn(t))P(\Y1-Y2\  =  l) 


<l-(l-dn(t))0A. 


As  x  <  ex  1  for  all  0  <  x  <  1,  for  any  subset  R  C  {1, . . .  ,ra}  with  at  least  n  —  2 
integers,  we  have 
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(/n,H(M,w))2  =  IJ I  f(tdj  +  s,u;)|2 


j€R 

<  exp  ( 

2-n  +  ^\f(tdj  + s,u)\ 

V 

j€R 

<  exp  1 

fl 

2-n  +  ^T/\f(tdj+s,uj)\ 

V 

3= i 

<  exp  ( 

2  —  0.4n(l  —  dn(t))\ . 

This  completes  the  proof. 

Lemma  5.  For  |t|  <  y/n/M' ,  we  have 


dn(t)  = 


i  n 

iE 

n  4-f 
j=i 


eitdj 


<  1  - 


and  for  0  <  k  <  2,  |s|  <  n,  |u;|  <  7 r,  |£|  <  y/n/M', 


fn,2{t,s,oj)  <  exp(l  -  ^ t 2). 


Proof.  For  |£|  <  y/n/M', 


□ 


(3.8) 


(3.9) 


dn(t)  = 


1  “  ~  1  ~  itdi  +  % d) ) 

2 n  nffl  3  2  3 ) 


3=1 

<  i  -  ^ + 


2n 


<  1 


3n 


i\ 


The  second  part  follows  from  (3.7). 


□ 


Lemma  6.  Suppose  |£|  <  logn,  |s|  <  7 r,  |u;|  <  7r  and  s2  +  u2  >  (logra)2n  l.  Then 
there  exist  constants  k\  and  Ar2  depending  only  on  M'  such  that  for  0  <  k  <2, 


fn,k{t,  s,ui)  <  ki  exp(— fc2(log  n)2). 


(3.10) 
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Proof.  As  E(Y?)  =  5, 


-ivpE^ei«Yl+iuS^  _  1  +  }_(U)V^0(uiVy 


<  {u2  +  u2f2. 


Since 


(w,?;)So(u,v)'  >  q(--q)(u2  +  v2), 

there  exists  a  positive  7?  <  —  q)  such  that  for  all  |tt|  <  47?,  |u|  <  47/,  we  have 

ID  Z 


<1-2  1(«,  ?;)£o(m,  v)'  +  ( u 2  +  v2)3/ 2 

<  1  _  |(I  _  g)(w 2  +  v 2)  +  (. u 2  +  u2)3/2 

<  1  “  (|4  ~q>>~  4??)  ^  +  ^ 

<  1  —  7/(«2  +  V2) 

<  e-»?("2+"2) 


(3.11) 


If  |£|  <  (log n)  then  \tdj\  <  (M')l^n  log n  <  7/  for  all  n  >  n(M',q).  In  this  case, 
if  |s|  <  27/,  |u;|  <  27/,  then  |s  +  tdj\  <  3t/  for  all  n  >  n(M',  t/).  So  by  (3.11), 


fn,k(t,s,u)  <  4\fn(t,s,u)\ 

<  4e_r,nls2+uj2l"'^2 

<  4e-,,nla2+a;2l 

<  4e_r?(logn)2 

provided  s2  +  u2  >  (logn)27i_1  and  |s|  <  27/,  |o;|  <  2t/. 

To  establish  inequality  (3.10)  for  the  remaining  s  and  ui,  note  that  b  given  by 

b(u)  =  E(eiuYl  |y:  >  0) 

is  the  characteristic  function  of  a  lattice  distribution  of  span  1.  So  for  0  <  7/  <  7r/10, 


SECOND  ORDER  CORRECTIONS  OF  THE  POISSON  BOOTSTRAP 


15 


E(e' 


.tuVi  +tv$i ' 


=  q  +  petvr(u) 

<  q  +  p\r(u)\  <1-7 


for  some  0  <  7  <  1,  whenever  77  <  |tx|  <71  +  17.  Hence  for  \t\  <  logn,  2 77  <  |s|  <  7T, 
all  u  and  for  all  large  n, 


\f(tdj  +  s,u)\  <1-7. 

Now  consider  the  case  |s|  <  2rj  and  |v|  >  2r\.  Let  0  <  6  <  77  be  such  that  |1  —  r(u)|  < 
77 2  <7  whenever  |«|  <  6.  Hence  as  0  <  77  <  1, 


tuVi+tv^i 


=  \q  +  petvr(u)\ 

<  \q+pew\  +p\l  -  r(w)| 

<  |<7  -f  pe,v|  +  rj2pq 

=  (1  —  2pq(l  —  cos  u))1/2  +  rj2pq 
=  (1  —  2pq(l  —  cost +  q2pq 
—  1  +  pq(  cos  277  —  1  +  T)2) 

<  1  —  rj2pq  <  1  —  02pq. 


Thus  if  |i|  <  log 7i  and  |s|  <  6/2  then  | rdj  +  s|  <  6  for  all  large  n.  So  if  |u;|  >  277, 
0  <  k  <  2,  then 


fn,k(t,s,u)  <  k3pn 

for  some  k3  >  0  and  0  <  P  <  1-  This  completes  the  proof  of  the  lemma.  □ 

4.  Second  Order  Correction 

Let  {Xi, . . .  ,  Xn}  be  i.i.d.  random  variables  with  mean  p  and  variance  a2.  Suppose 
X\  satisfies  Cramer’s  condition 
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limsup|£(eitx,)|  <  1. 

|t|-*oo 


(4.1) 


Let  Xn  —  —  Xi,  —  —  ZiM  —  Xn)2  and  Cj  —  Cj<n  —  (Xj  Xn)/sn.  Let 

71/  71/ 

Yj ,  >2 >  •  •  •  be  a  sequence  of  i.i.d.  Poisson  random  variables  with  mean  1.  Put  <5*  =  1 
if  Yi  >  0  and  =0  otherwise.  Let  N  =  T*  =  E^»>  9  =  e_1  =  =  0)  and 

i=i  *=i 

1  « 

p  =  1  —  e_1.  We  shall  obtain  Edgeworth  expansions  for  given  Tn  —  m 

v  Nj=i 

and  X\, . . .  ,  Xn.  Then  use  this  result  to  establish  second  order  correctness  for  the 
Poisson  Bootstrap  in  the  simple  case  of  \fn(X  —  p) /a.  This  result  can  be  extended 
to  statistics  which  can  be  represented  as  a  smooth  function  of  a  multivariate  mean. 
Second  order  correctness  of  Poisson  Bootstrap  for  such  models  are  discussed  in  Section 
5. 

We  now  state  the  main  theorem. 


Theorem  4.1.  Let  E\X\  |5  <  oo  and  the  characteristic  function  of  X\  satisfies  Cramer’s 
condition  (4.1).  Ifm  —  np  is  bounded,  then 


P (-L  J2(Xi  -  Xnj  Yi  <  xsn\Tn  =  m;  Xu . . .  ,  Xn) 

-p(Xf2(Xt  -  E(X,))  <  xLj  =  0p(n-‘),  (4.2) 

uniformly  in  x. 

Remark  1.  With  truncation  and  additional  analysis,  the  moment  condition  on  X\  can 
be  relaxed. 

To  prove  Theorem  4.1  we  first  establish  some  notation  and  a  preliminary  proposi¬ 
tion. 

Let  4>  denote  the  standard  normal  density,  ifo  denote  the  bivariate  normal  density 
with  zero  mean  vector  and  dispersion  matrix 
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1 

q 

where  q  =  e_1  and  p  —  1  —  q.  Define 

pn{x)  =  - 

<W*.  »,<•>)  =  ftW  + 

=  i  (E((Z'Ti\y,u)')(Y  -  Ifpq  +  (q-  p)6)/(q(p  -  q))\ 

+  jtr2  -  1  )e((Y  -  1)2Z'E0->(!,,uA 

where  Y  is  a  Poisson  random  variable  with  mean  1,6  =  I{y>o}  and  Z  =  (Y  — 1,6— p)'. 
For  brevity  fix  X\ . . .  Xn  and  define 


Vr  =  (r-  n)n  1/2 ,  um  =  (m  -  np)n  1/2 , 


and 


Fn(x,  r,m)  =  P ^  QYi  <  x\/n,  N  =  r,  Tn  =  rn^j . 

Note  that  Y^Ci  =  0  and  Y^l  =  n- 

i=l  t=l 

Proposition  1.  Suppose  for  any  K  >  0,  there  exists  a  0  <  7  <  1  such  that 


lim  sup  sup 

n-foo  K-<|t|<n3 


1  ” 
n 

i=i 


<  7- 


Suppose  for  some  M  >  1,  X!|Cj|5  <  riM.  Then  uniformly  in  x  and  r,  we  have 

i=  1 
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where 


GiYi  -  N  =  r,  Tn  =  rnj 

=  [  ^n(v,yr,Lon)dv  +  0(n~3/2), 

J -oo 


(4.3) 


*r>(v,y,w)=4i(v)<po(y,u)(l  +  -j£Q*(v,V,u)  +  ^Qn,(v,y,w)'j  (4.4) 

Qm  is  a  fourth  degree  polynomial  inv,y,u  whose  coefficients  are  bounded  by  a  constant 
depending  only  on  M. 

We  use  the  lemmas  proved  in  Section  3  in  proving  the  proposition. 

Proof  of  Proposition  1.  Since  for  any  e  >  0, 

px+i 

/  <f){u)dv  =  0(e) 

Jx~e 

uniformly  in  x,  we  have  by  Lemmas  1  and  2,  with  e  =  n-2, 


nFn(x,r,m)  —  f  ^n(v,yr,um)dv 

J -OO 


<  n 


-2 


/IT  PIT  p  —  cn' 

JJ. 


+  max 

0<j<2 


Dj\ 


dt  ds  dw,  (4.5) 


(nfn(t,s, u)e~i9r-^m  -  9n(t,8,u)j 

n  /  1  \ 

where  <]/„  and  /„  denote  the  characteristic  functions  of  and  Y  —^=CjYj,  Yj,6j  J. 

i=i\Vn  J 

Note  that  for  each  r  and  m,  the  Fourier  transform  of  Fn(-,r,  m)  is  given  by 


Fn(t,r,m)  =  J  J  fn(t,s,u)e  tar  Vi>mdsdu.  (4.6) 

To  estimate  the  last  integral  in  (4.5),  we  try  to  divide  the  range  of  integration  into 
4  possibly  overlapping  regions: 

(i)  |f|  <  logn,  |s|  <  (logn)n~1/2,  |u;|  <  (log n)n-1/2 

(ii)  \t\  <  logn,  u2  +  s2  >  (logn)2n-1,  |u;|  <  7 r,  |s|  <  n 
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(iii)  logn  <  |£|  <  y/n/M ,  |s|  <  7r  and  |u;|  <  7r 

(iv)  ^y/n/M^j  <  |t|  <  cn5/2,  |s|  <  7T  and  |a»|  <  7 r, 
where  c  is  the  constant  in  (3.1). 

We  expand  nD*  fn(r,s,u)  in  region  (i)  and  estimate  the  integral  in  (4.5)  as  in  the 
proof  of  Theorem  9.9  of  Bhattacharya  and  Ranga  Rao  (1986).  Lemma  6  is  used  for 
region  (ii),  Lemma  5  is  used  for  region  (iii)  and  Lemmas  3  and  4  are  used  for  region 
(iv)  to  estimate  the  integral.  These  estimates  lead  to  a  bound  of  0(n-3/2)  for  (4.5), 
completing  the  proof. 

Proof  of  Theorem  4-1-  Using  Proposition  1,  we  shall  first  show  that  uniformly  in 
x, 

p(^=J2(xi-X)Yi<xsnTn  =  m,  Xlt...  ,Xn ) 

=  $(x)  +  —^=73(1.  —  x2)(f){x)  +  °p(“))  (4.7) 

by  71  n 

where  is  the  standard  normal  distribution  function  and 

73  =  a~3E(X  1  -  p)3. 

Let 

MI^£(5H-,og4 

Then  by  a  moderate  deviation  result 

l-P(Hn)=0(n~10). 

n 

Suppose  —  Xn|5  <  nMsn  for  some  M  >  1,  then  by  Proposition  1, 

3= 1 

£(*,  -  X.)Yi  <xs„,T„  =  m 


(4.8) 
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=  £  ^p(-^YiCjYj<x,N  =  r,T„  =  m)+  0(n<>) 

|r“n|<v/nlogn  Wr  j— i  / 

j  _  rxyjrjn 

=  ~fo  X!  /  Mv,yr,Um)dv  +  Q{n-1) 

lr-nl<\/rclogn  ~°° 

=  7T  E  /_  *•  («p  +  ^>‘/2'  *•  <*•)  (i  +  7  *•  +  ot"'1) 

|r-n|<  ^/n  logn  N  v  /  \  v  / 

= /»(/, i<bg„*n(“(1+^)1/2’y’Wm)(1+^) /d^“+0(n^  (49) 

By  Theorem  13  on  local  Edgeworth  expansion  on  pages  205-206  of  Petrov  (1975),  we 
have 


2 

y/npqP(Tn  =  m)  =  0(o;m)  +  ^w'i/2^(wm)^(a;m)  +  0(n_1) 

j=l 

=  0(^m)  +  ^_1/2gi(^m)0(^m)  +  0(n-1)  (4.10) 

where  q\  and  q%  are  polynomials  and 

«.(!/)  =  jE(«i-p)3(w)-3/2(»3-3y). 

The  estimate  (4.7)  follows  from  (4.8),  (4.9)  and  (4.10)  after  some  simple  algebra.  The 
Theorem  now  follows  from  the  standard  Edgeworth  expansion 

P{^  ~M)^  axj  =  ®(x)  +  ^7a(l  “  ^)<KX)  +  0(n_1), 

and 

^  |^»  —  ^n|5  >  nMs^j  =  0^1^  for  some  M  >  1. 

This  completes  the  proof. 
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5.  Smooth  Functional  Model 

Proposition  1  and  Theorem  3.1  can  be  extended  to  multivariate  case  and  to  statis¬ 
tics  which  can  be  expressed  as  smooth  functions  of  multivariate  means.  Let  X\ . . .  Xn 
be  a  sequence  of  i.i.d.  random  vectors  with  mean  (i  and  dispersion  S.  Let  £n  denote 
the  sample  dispersion  matrix  of  X\ , . . .  ,Xn.  With  some  additional  effort  and  using 
the  ideas  and  lemmas  from  Bai  and  Rao  (1991)  and  Babu  and  Bai  (1996),  we  can 
establish  the  following  results. 

Theorem  5.1.  Suppose  the  characteristic  function  of  X\  satisfies  Cramer’s  con¬ 
dition  and  E  j j X\  1 1 4 .  Let  H  be  a  3-times  continuously  differentiable  function  in  a 
neighborhood  of  p.  Let  £(y)  denote  the  vector  of  first  order  partial  derivatives  at  y, 
and  £{p)  ^  0.  If  m  —  n(l  —  e_1)  is  bounded,  then  for  almost  all  sample  sequences 
Xi,X2, . . . ,  we  have 


sup  y/n 

Z 


xw-mxn))  ^ 

V  y/e(xn)Y*e(xn)  "  *  T"  ~ 

-p(y^(H{Xn)  -  H(rij  <  zy/eMTXM'j 


as  n  — *  oo. 


m;X j, 
-0, 


The  next  two  results  are  more  suitable  for  applications  to  studentized  statistics, 

Theorem  5.2.  Let  {Xn}  be  as  in  Theorem  5.1.  Suppose  the  function  H  is  three 
times  continuously  differentiable  in  a  neighborhood  of  the  origin  and  II (0)  =0.  If 
m  —  n(l  —  e_1)  is  bounded,  then  for  almost  all  sample  sequences  Xi,X2, ... ,  we  have 


sup  y/n\ 


p{ 


s/mWto) 

-p(y/nH(Xn)-li<zy/F( 0)Ee(d)) 


Tn  —  m\  X\, . . .  ,  Xn 


0, 
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It  is  easily  seen  that  the  second  order  correctness  of  the  sequential  bootstrap  pivot 
such  as 


tt*n  = 


/Sn 


given  that  Tn  :=  ^  £"=1 1{Y.  >o}J  =  m  follows  from  Theorem  5.1.  The  one-term 
correction  captures  the  skewness  of  the  underlying  distribution. 

As  the  most  commonly  used  statistics,  especially  the  studentized  versions,  are  of 
the  type 


t„  =  (J? )  -  H(V) ) /{ (i p A(X,)) ,  (5.1) 

where  A  is  a  function  on  R*  —*  Rr  and  £  is  a  smooth  real-valued  function  on  Rp.  The 
classical  Student’s  t  is  an  example  of  this  type  of  statistic.  If  Xi  are  univariate,  then 


tn  = 


y/n(Xn  -  Tj) 


satisfies  (5.1)  with  H(x )  =  x,\(x)  =  (x2,x),v(x,y)  =  max(0,  (x  —  y2))l/2  and  s2  = 
-  Zir=i  (xi  ~  X)2 .  The  version  corresponding  to  (5.1)  under  the  Poisson  scheme  is 

Tl 

generally  of  the  type, 


t„iX)  = 

denote  the  sample  dispersion  of  Xi, . . .  ,  Xn.  As  in  Theorem  4  of  Babu  and 
Singh  (1984),  we  can  derive: 

Theorem  5.3.  Let 


v(E{\{X i)»  = 

v(X)  =  Je<(Xa)E„e(x), 
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and  let  L(Xi)  be  a  linearly  independent  sub  collection  of  (X,,  A(Xi))  with  the  property 
that  all  the  coordinates  of  (Xi,\(Xi))  can  be  expressed  as  linear  combinations  of 
those  of  L(Xi).  If  the  characteristic  function  of  L{X\)  satisfies  Cramer’s  condition 
J5||L(Xi)||4  <  oo,  and  ifm  —  np  is  bounded  then 


sup  ^/n\P(tn(Y)  <a\Tn  =  m,  Xu...  ,Xn)-P{tn  <a)|->0, 


as  n  — ►  oo,  for  almost  all  sample  sequences  {Xj}. 
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